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1.  PRELIMINARY  CONCEPTS 

1.1.  Introduction 

The  subject  matter  of  this  work  is  minimsx  parameter  estimation. 

In  the  present  chapter  we  will  introduce  some  basic  concepts  and  definitions 
from  the  theory  of  parameter  estimation  and  through  these  arrive  at  the 
results  and  techniques  which  we  will  use  to  search  for  minimax  solutions  to 
parameter  estimation  problems. 
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1.2.  Parameter  Estimation  Theory 

In  general  a parameter  estimation  problem  can  be  stated  as  follows: 
Let  Z be  a random  variable  having  a probability  distribution  function 
Fq (Z) . The  subscript  in  the  distribution  function  indicates  that  it  depends 
on  an  unknown  parameter  9.  In  keeping  with  the  standard  literature  on  the 
subject  a canonical  personage  usually  identified  as  the  statistician  or 
decision  maker  is  introduced  to  facilitate  the  exposition  of  the  problem. 

The  statistician  is  interested  in  estimating  the  unknown  parameter  9 from  a 
single  observation  of  the  random  variable  Z.  More  specifically  a function 
b,  called  a decision  rule  or  an  estimator,  is  to  be  designed  so  that  the 
random  variable  S(Z)  provides  the  statistician  with  a "good  guess"  or 
estimate  of  the  value  of  the  unknown  parameter  0.  At  this  point  we  must 
address  the  following  problem:  How  shall  we  evaluate  the  goodness  of  a 

given  estimate?  Thi?  1*3  the  question  of  choosing  a criterion  for  estimation. 
In  this  work  we  shall  consider  the  loss  function  approach  to  parameter 
estimation  and  apply  the  nrairaax  criterion.  This  approach  and  criterion  are 
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Definition  3:  Two  decision  rules  are  equivalent  if  they  have  the  same 

conditional  risk  function. 

Definition  4:  A decision  rule  b is  said  to  be  a Bayes  rule  if  for  every 

other  rule  y 


E{r[6,0]}  < E{r[V,0]3  (1.2.4) 

where  the  average  is  taken  over  a prior  distribution  X(8)  for  9,  that  is,  6 
minimizes  the  average  risk. 

Definition  5:  o*  is  said  to  be  a minimax  decision  rule  if  for  all  other 

rules  5 


sup  r[6*,8]  < sup  r[6,8].  (1.2.5) 

ego  e^o 


That  is  6*  infimizes  sup  r[6,9]. 

901 

1 .3  The  Relation  of  Game  Theory  to  Minimax  Estimation 

Minimax  estimation  theory  is  closely  related  to  the  theory  of  two- 
pers.n  zero-sum  games.  The  elements  of  a two-person  zero-sum  game  are: 

(i)  The  two  players,  usually  identified  as  the  Statistician  and 

Nature . 

(ii)  The  Kernel  that  represents  the  pay-offs  of  the  game. 

The  Kernel  is  a function  of  two  variables  each  of  which  is  controlled  by  one 
of  the  players.  When  a minimax  estimation  problem  is  considered  as  a game, 
the  kernel  of  the  game  is  identified  with  the  average  risk 


a 
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i 

J * 


J r[6,6]  dX  (8 ) . 

n 


(1.3.1) 


The  two  variables  are  the  decision  rule  6 controlled  by  the  Statistician  and 
the  distribution  X (9)  controlled  by  Nature. 

The  rules  of  the  game  are  such  that  the  Statistician  looks  for  a 
decision  rule  6 that  minimizes 


Sup  J*  r[6 ,0]  dX  (8  ) 

0 


while  Nature  tries  to  find  a distribution  X(6)  that  maximizes 

inf  J*  r[6,8]  dX(8). 

6 n 

Such  a distribution  is  called  a least  favorable  prior  distribution. 
It  can  be  seen  that 


(1.3.2) 


(1.3.3) 


Sup  j r[6,9]  dX(8)  = Sup  r[6,8], 
X fi  8 €fi 


(1.3.4) 


so  that  if  the  Statistician  succeeds  in  the  game  he  has  found  a minimax 
solution  for  the  estimation  problem. 

The  fundamental  theorem  in  game  theory,  the  minimax  theorem,  states 
that  jr  certain  conditions 


inf  sup  f r[6,0]  dX.(0)  = Sup  inf  J r[6,8]  dX(0).  (1.3.5) 

6 X b * 6 n 

Now  if  the  minimax  theorem  holds  and  a least  favorable  prior  distribution 
exists,  then  any  minimax  rule  is  Bayes  with  respect  to  this  prior 
distribution.  See  [1]  for  an  extended  discussion  of  Bayesian  decision  theory 


and  the  minimax  criterion. 


L>  mma  1 : A Bayes  rule  is  admissible  if  it  is  unique  up  to  equivalence. 

Proof : (see  [1],  p.  60). 

Lemma  2 : (Wald).  Let  6^  be  a Bayes  rule  with  respect  to  r[6,0]  and  X(0). 

Assjme  that  max  r[6i,  ,0]  = r exists.  Define 

een 

= {egn  : r[6^,0]  = r}.  (1.4. 

Assume  that  X[t^.]  =1.  We  now  have: 

(i)  6^  is  rainimax 

(ii)  X is  a least  favorable  prior  distribution. 

Proof:  (i)  (by  contradiction).  Assume  there  exists  a decision  rule  6,  and 

a number  c > 0 such  that 

max  {r[6-0]+c}  < max  r[6^  ,0]  (1.4.; 

0€n  een 

i j.1  then 

j {r[6  ,0] +c}dX  (9  ) < max{r[&,9] 4 c} 

0 (1.4.: 

< max  r[6i,  ,0] 

9€ft 


«n&<  r[6,  ,0]  = r = J r[6x,9]  dX(9), 

061  a 

Therefore 


since  X assigns  probability  1 to 


j r[6 ,0]  dX(9)  +c  < j r[6.  ,0]dX  . 


(1.4.* 


f MriMUtf 
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This  last  inequality  contradicts  the  assumption  that  6^  is  a Bayes  rule  and 
we  therefore  conclude  that  6^  is  minimax. 

(ii)  (By  contradiction) 

Assume  there  exists  another  prior  probability  measure  p and  a number  C > 0 
such  that 


inf  J r[6,e]  dX(9)+c  < inf  J r[6,9]  dp- (9) 

6 n 6 fi 


(1.4.5) 


J r[5x,6]  dp, (9)  > inf  J r[6,9]  dp.(9) 

n 6 n 


> inf  J r[6,9]  dX(e)+c  = r+c 

6 n 


so  that 

J r[6,  ,0]  dp, (9) > r+c  > r 
Q X 

which  is  a contradiction;  therefore  X is  a least  favorable  prior 
distribution. 


(1.4.6) 


(1.4.7) 
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2.  MINIMAX  PARAMETER  ESTIMATION  UNDER  GENERALIZED  QUADRATIC 
LOSS  WITH  A COMPACT  PARAMETER  SPACE 


2.1.  Introduction 

The  estimation  problem  we  shall  consider  in  this  chapter  belongs 
to  a general  class  of  problems  that  can  be  described  by  the  following  model: 

Let  z = 0 + v where  0 €Q,  ft  is  some  specified  parameter  space 
contained  in  En,  and  v is  a zero-mean  Gaussian  random  vector  with  known 
positive  definite  covariance  matrix  E.  We  want  to  find  a minimax  estimator 
for  the  unknown  parameter  0.  It  is  known  [2,3,4]  that  the  existence  of 
minimax  estimators  and  the  structure  of  these  estimators  when  they  exist  is 
strongly  dependent  on  the  structure  of  the  problem  which  is  affected  by  the 
underlying  loss  function,  the  parameter  space  Q and  the  class  of  allowed 
decision  rules. 

When  the  loss  function  is  the  Standard  Quadratic  Loss  function, 
Lct[6(z),0]  = [6(z> -0]'c[6(z) -0]  , 

and  the  parameter  space  Q is  En,  a minimax  solution  for  this  problem  is 
6(z)  = z;  see  [1],  p.  170  for  further  details. 

The  standard  quadratic  loss  function  is  generalized  by  adding  to 
it  a quadratic  term  In  0 obtaining 

L^SCz),©]  = [6  (z)  -0]'C[6(z)  -0]  - 0'D0. 

This  loss  function  forces  Nature  to  compromise  between  a gain  due  to 
increasing  the  Statistician’s  estimation  error  and  a loss  which  is  equal 
to  the  square  of  the  norm  weighted  by  D.  For  an  unconstrained  parameter 
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space,  i.e.  0 * En,  this  problem  was  solved  by  Basar  and  Mintz 

[2]. 

The  parameter  6 can  be  restricted  to  reside  within  a known 
compact  subset  of  En,  for  example  a hyperellipsoid  defined  by 

Q={0€En:  0'G0  < e2,  e2  > 0,  G > 0} . 

In  this  case  a linear  minimax  estimator  for  0 under  the  standard  quadratic 
loss,  function  was  found  by  Mintz  [3].  We  will  be  considering  here  the  problem 
of  finding  a linear  minimax  estimator  under  a generalized  quadratic  loss 
function  and  will  constrain  the  parameter  0 to  reside  within  a known 
hyperellipsoid  in  E°. 


2.2.  Problem  Statement 

Let  z = 0 +v  where  v is  a normal  zero -mean  random  vector  with 
known  positive  definite  covariance  matrix  Z and  0 is  an  unknown  element  of 
a known  hyperellipsoidal  subset  of  E*1  denoted  by  Q and  defined  by 

fi  = {0  € En  : 0 'D20  < e2;  e2  > 0,  D2  > 0}  (2.2.1) 

We  wish  to  determine  a minimax  estimate  5(z)  for  0 with  respect  to  the 
generalized  quadratic  loss  function 

L[8(z),0]  = [6<z)-0]'c[6(z) -0] -0'D.a0  (2.2.2) 

where  C > 0 and  > 0,  subject  to  the  restriction  that  6 be  a linear  rule. 
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2.3.  Solution 

Define 

r (A, Of)  - (A+S)"1SCE(A+I:)"1  -Dj,  -aD2 


with 


A > 0 and  a > 0. 


Let  A and  a denote  any  solution  to  the  matrix  equation: 


r (A*, a*) A*  * 0 

sach  that 

r (A*, or*)  < 0 

and 

Tr[D2A*]  < e2 . 


(2.3.1) 


(2.3.2) 


(2.3.3) 


(2.3.4) 


(2.3.5) 


Theorem  I: 

(i)  If  T (A, or)  is  defined  by  (2.3.1)  and  (2.3.2)  then  there  exists  a 
lovariance  matrix  A*  and  a scalar  Of*  > 0 that  satisfy  (2.3.3)  and  (2,3,4); 
and 

•-ithet  (a)  a*  = 0 and  TrfD^A*]  = e2  < e2 

or  (b)  Of*  > 0 and  Tr[D„A*]  * e2  * e~. 

2 * 

(li)  If  6(z)  is  defined  by 

6*(z)  * A*(A*+v)“12  (2.3.6) 


then  6*(z)  is  a linear  minimax  estimate  for  9.  A proof  of  part  (i)  of  the 
main  theorem  may  be  obtained  by  generalizing  the  proof  that  appears  in 
Appendix  I of  (3).  The  essential  modification  of  that  result  needed  to 
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show  part  (i)  is  obtained  by  replacing  J(A,A)  as  defined  in  Lemma  2 of  [3] 

fcy 

J(A,A)  = Tr{[(l-A)'C(I-A) -D^A}  + Tr[CAEA'].  (2.3.7) 

We  note  at  this  point  that  the  matrix  of  this  present  paper  corresponds 
to  D of  reference  [3].  Since  the  modified  proof  is  very  similar  a complete 
discussion  will  be  omitted.  We  also  note  that  the  last  two  assertions  in 
Appendix  I of  [3]  do  not  apply  in  the  present  problem  due  to  the  difference 
in  the  structure  of  the  loss  function. 

A proof  of  part  (ii)  of  Theorem  I is  obtained  using  Lemma  2 of 
Section  1.4.  We  need  a prior  probability  distribution  for  0 which  we  define 
as  follows: 

2 2 

Assume  Tr[D-A]  = e < e , where  A > 0 and  consider  the  matrix 

i * — — 

h h h 

D|Ad’,  where  D2  is  the  unique  symmetric  positive  definite  square  root  of 

v i 

^>2 • Assume  T diagonalizes  D2^D2*  tlmt  is 

S = T'D^Ad^  T * diag(s1,...,sn).  (2,3.8) 

Lft  m denote  an  n-dimensional  random  vector  with  independent  components 
denoted  by  m^  , i = l,...,n  whose  distribution  is  defined  by: 

- sj}  = , Pr{mi  = -<sj)}  = %.  (2.3.9) 

It 

The  random  vector  m takes  on  2 values  with  equal  probability,  where  k is 
the  rank  of  A.  Let  m<[A]  denote  the  probability  measure  defined  by  the 
distribution  of  the  random  vector  y = D2^Tm.  We  note  that  the  support  of 
the  measure  p.[A]  lies  entirely  in  the  boundary  of*  Q*,  a hyperellipsoidal 
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1 

3 

I 

I 

subset  of  En  contained  in  Q and  defined  by  j 


I 


r 


! 


i. 

I 

1 


fi*  = {y  6 En  : yr^y  < e^  < e2} . (2.3.10) 

k 

This  assertion  follows  from  the  fact  that  for  each  of  the  2 equally  likely 

-h 

values  of  m the  vector  y = D2  Tm  lies  on  the  boundary  of  Q*  since 
y'Dy  = m'T'D’^D^Tm  = m'm 

= 2 at  = Tr[ I>2  A D2]  = Tr[D2Al  = e2  < e2.  (2.3.11) 

Further,  we  note  that  E[y]  = 0 and 

-k  -k  -k 

E[yy ']  = E[D2*T  mm' T'D2  ] = D ^TST'D^ 

= D^TT'  dJad^T'  TD^ 

* A . (2.3. lI) 

No.',  let  X denote  a prior  probability  distribution  for  9 such  that 
6 ~ u[A].  For  this  prior  distribution  a Bayes  estimate  is  given  by 

6,  (z)  = A(A  +E)_1z.  (2.3.13) 

Xo 

This  follows  from  the  fact  that  the  loss  function  is  quadratic  in  the 
estimation  error  and  therefore  within  the  class  of  linear  rules  for  a prior 
distribution  with  covariance  A the  rule  defined  by  (2.3.13)  minimizes  the 
average  risk.  In  what  follows  we  will  show  that  X = p,[A  ] and  6 (z) 
defined  by  (2.3.6)  satisfy  the  conditions  stated  in  Lemma  2 Section  1.4 


and  thus  establish  part  (ii)  of  the  main  theorem.  We  need  an  expression 
for  the  conditional  risk  function: 
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r[6x  ,e]  = e{l[6.  (z),e-]|e} 

o A0 

= e'UA+ErtcS^A+E)"1  -D1]e 
+ :r[(A+E)“1AcA(A+S)"1I]  . (2.3.14) 

The  fact  that  X*  and  6*(z)  satisfy  Lemma  2 of  Section  1.4  is  established 
in  several  steps  as  follows: 

Step  1:  Observe  that  for  all  6 of  the  form  6(z)  s A(A+£)  ^z.  A > 0 the 

maximum  of  the  conditional  risk  function 

F = max  r[6,e] 

exists  since  r is  continuous  in  0 and  Q is  compact. 

Step  2:  First  consider  the  case  when  the  solution  to  (2.3.3)  and  (2.3.4) 

satisfy  condition  (a)  of  part  (i)  of  the  theorem.  In  this  case  we  note  that 
the  conditional  risk  function  can  be  written  as: 

r[S*,9]  = 9 T (A*»O)0  + Tr[ (A*  +£)”1A*CA*(A*  +Z)-1Z],  (2.3.15) 

then  (2.3.4)  implies  that  T^*»  defined  by 

= {9  € ^ : r[S*,0]  = r}  , 

coincides  with  the  null-space  of  r(^*,0),  that  is 

Tx*  = N[r(A*,0)]. 


(2.3.16) 


(Vlipppa|liHiip9ppMl9i^mmvn,<' '» 'M' jm,>'|mii  i«* 
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When  case  (b)  of  part  (i)  is  in  force  the  conditional  risk 
function  can  be  written  as 

r[5*,e]  = 9T(A*,o*)0  + a*e'D2e 

+ Tr[(A*+£)'VcA*(A*+E)"V|.  (2.3.17) 

It  is  obvious  that  for  this  case 

TX*  « N[r(A*,er*)]rtaO  (2.3.18) 

where  90  - {0€E°  : 0'D28  = e2}  . We  next  prove  that  in  cases  (a)  and  (b) 

x*0x*]  - i. 

Step  3:  In  both  cases  considered  in  step  2,  is  a subset  of  the  null- 

space  of  r(A*,a*).  We  will  now  prove  that  X*  assigns  probability  one  to 
NjT  (A*, <**)].  From  (2.3.4)  we  conclude  that  0,r(A*,o'*)0  < 0 but 

E(eT(A*,a*)8]  * E{Tr[0T(A*,n*)0]} 

= E{Tr[r(A*,a*)00']3 

= Tr[r (A*,a*)A*3  (2.3.19) 

and  (2.3.3)  implies  that 

Tr[r  (A*,a*)A*]  = 0 . (2.3.20) 

So  we  have 

0T(A*,a*)e<O  (2.3.21a) 


and 


E[0T(A*,a*)0]  * 0. 


(2.3.21b) 
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This  allows  us  to  conclude  that  X*  assigns  probability  one  to  the  null- 
space  of  T A*, a*).  For  the  case  where  A*  and  <**  satisfy  (a)  in  part  (i) 


this  proves  that  X*[t.  *]  = 1. 

A. 


For  the  situation  in  which  case  (b)  of  part  (i)  is  in  force,  we 
note  that  the  discussion  following  (2.3.10)  establishes  that  X*  « M-(A*) 
assigns  probability  one  to  the  boundary  of  Q*,  which  coincides  with  Q in 


O 

this  situation,  since  Tr[Dj\*]  * e . Hence 


X*[50]  = 1 and  X*[N(T (A*, a*))]  = 1 


and  therefore  we  conclude  that  t.*  as  defined  by  (2.3.18)  receives 

K 

probability  one  from  X*.  We  note  at  this  point  that  we  have  a prior 


probability  distribution  X*,  a Bayes  estimator  6*(z)  with  respect  to 


X*  and  r[5,8]  and  X*  assigns  probability  one  to  the  set  where  r[6*,0] 


attains  its  maximum  value,  i.e.  r.  Making  use  of  Lemma  2 of  Section  1.4 
vc-  conclude  that 


-1 


5*  - A*  (A  + E)  z 


~-n 

gi 

# 

i 


is  a linear  minimax  estimate  for  8.  This  finishes  the  proof  of  part  (ii) 
of  the  main  theorem. 


2.4.  The  Scalar  Case 

The  unidimensional  version  of  the  problem  stated  in  Section  2.2 
will  be  treated  in  this  section.  This  scalar  case  admits  an  explicit 
solution  and  in  obtaining  it  insight  is  gained  into  the  structure  of  the 
solution  to  the  vector  case.  We  will  state  the  problem  and  study  its 
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2 

solution  in  what  follows.  Let  z = 8 +v  where  v~  N[0,o  ] and  8 is  an 

2 2 2 

unknown  element  of  fi  defined  by  ft  = [8  €E*  : 8 < e , e > 0}  . We  wish 

to  aetermine  a linear  minimax  estimator  6(z)  with  respect  to  the  loss 
function: 

L[5(z),8]  = c(6  - 8)2  - d82  (2.4.1) 


where  c > 0 and  d > 0.  For  this  loss  function  the  conditional  risk  function 
for  linear  decision  rules  of  the  form  6 = a2  is  given  by 

r[6 ,8*J  = [c(a-l)2  -d]82  + ca2a2.  (2.4.2) 

There  are  two  cases  to  consider  depending  on  the  relative  magnitudes  of  c 
and  d. 

Case  1:  c-d  < 0 

For  this  case  an  appropriate  prior  distribution  for  8 is  X*(8)  such  that 
8 = 0 with  probability  one.  The  average  conditional  risk  with  respect  to 

/ * t c 

J.  S 

^*{*[6,8]}  = cs~c?.  (2.4.3) 

A Bayes  rule  is  one  that  minimizes  (2.4.3)  which  turns  to  be  positive 
except  when  a*0  so  that 

6*(z)  = 0 (2.4.4) 

is  the  required  Bayes  rule.  Using  the  rule  defined  by  (2.4.4)  in  (2.4.2)  the 
conditional  risk  function  results 


r[5,8]  = [c  - d]82 


(2.4.5) 


.since  c-d  <0  r = 0 occurs  when  9 = 0.  This  is  the  point  that  receives 
probability  one  from  \*  and  therefore  o*(z)  * 0 is  minimax  according  to 
Lemma  2 in  Section  1.4. 

Case  2:  c-d  > 0 


e > c (Vj-  1), 


(2.4.6) 


then  >.*  assigning  equal  probability  to 


= + q = + a (J~%  - 


(2.4.7) 


is  an  appropriate  prior  distribution.  With  this  prior  distribution  the 
average  conditional  risk  is 


E.  *{ r[6 ,9  j}  = [c(a-l)2  - d]q2  + ca2cr2. 


(2.4.8) 


by  elementary  calculus  the  value  of  a that  minimizes  (2.4.8)  is 


2 2 
q +c 


and  t.hi:  corresponding  Bayes  rule  results 


= a'^z . 


(2.4.9) 


(2.4.10) 


; sr  of  this  rule  in  the  conditional  risk  yields  a constant  value  independent 


i.f  the  value  of  9 in  i.c.. 
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This  particular  situation  characterizes  6*  as  an  equalizer  rule,  i.e.  one 
that  produces  a constant  conditional  risk.  In  this  case  is  identified 
with  the  whole  parameter  space  0,  the  condition  of  Lemma  2 Section  1.4  that 
X*[t  *]  be  one  is  thus  trivially  satisfied.  We  note  that  all  the  conditions 
of  Lemma  2 have  been  established  and  we  conclude  that  in  this  case  2 under 
the  condition  defined  in  (2.4.6)  the  decision  rule  defined  by  (2.4.7), 
(2.4.9),  and  (2.4.10)  in  a minimax  rule. 

On  the  other  hand,  if  condition  (2.4.6)  is  not  satisfied  but 

e2<cr2(/^-l)  (2.4.11) 

we  define  a prior  probability  distribution  for  0 by  letting  X*  assign  equal 
probabilities  to  e =+e,  that  is: 

Pr{0  « e]  * Pr{0  = -e}  * j . (2.4.13) 

With  this  probability  distribution  the  average  conditional  risk  becomes 

E^*[r[i,6]}  = [c(a-ir  - d]e2  + ca2c2.  (2.4.13) 


By  reasoning  similar  to  that  which  lead  to  (2.4.9),  the  value  of  the 
multiplier  a*  defining  the  Bayes  rule  for  this  case  is 


a 


* 


2 

e 


2 

e 


(2.4.14) 


and  the  Bayes  rule  is 


c* 

o 


- 

a z 


(2.4.15) 


The  conditional  risk  function  for  this  Bayes  rule  is 
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r[8*,0]  = [c(a*-l)2 -die2  + c(a*)V. 


(2.4.16) 


The  conditional  risk  is  quadratic  in  6 and  the  coefficient  of  the  quadratic 
term  is  positive.  This  assertion  follows  from  equation  (2.4.14)  defining  a* 
and  from  condition  (2.4.11).  The  maximum  value  of  the  conditional  risk 
occurs  then  at  the  extreme  points  of  the  parameter  interval,  i.e. 

* { -e,e} . (2.4.17) 


Obviously  )»*[t  = 1,  and  according  to  Lemma  2 of  Section  1.4 

2 


2 . 2 
e +cr 


is  a linear  minimax  estimate  for  6. 


2.5.  Remarks 

The  main  theorem  in  Section  2.3  provides  us  with  a linear  minimax 
estimate  of  the  unknown  parameter  9 with  respect  to  the  generalized  quadratic 
loss  function  defined  in  Section  2.2  where  the  parameter  space  is  defined  to 
be  a hyperellipsoid  Tt  subset  of  En.  The  least  favorable  prior  distribution 
which  we  associated  with  this  minimax  estimate  is  a discrete  probability 
distribution  that  has  support  on  the  boundary  of  a hyperellipsoidal  subset  of 
En,  which  we  denoted  by  0*,  contained  in  the  original  hyperellipsoid  Q.  The 
minimax  estimator,  the  least  favorable  prior  distribution,  and  the  hyper- 
ellipsoid 0*  are  all  determined  by  the  solution  to  the  algebraic  matrix 
equations  stated  in  part  (i)  of  the  theorem  of  Section  2.3. 


jp  ' \ ' ~ 

f fe 

k m mm 

If  I 


When  the  restriction  imposed  on  the  parameter  that  it  reside  in 
this  hyperellipsoid  is  lifted,  Basar  and  Mintz  [ 2]  showed  that  the  minimax 
estimate,  over  linear  and  nonlinear  rules,  is  a linear  estimate  and  the  least 
favorable  prior  distribution  is  a zero-mean  normal  distribution.  A closely 
related  problem  solved  by  Mintz  [3]  in  which  the  loss  function  is  the 
standard  quadratic  loss  function  and  the  parameter  space  is  a hyper- 
ellipsoidal  subset  of  En  exhibits  a similar  structure  in  the  least  favorable 
prior  distribution  to  the  problem  treated  here,  with  the  difference  that  in 
that  case  the  support  of  the  discrete  prior  distribution  lies  always  on  the 
boundary  of  the  parameter  space.  The  class  of  decision  rules  considered  in 
(3]  is  also  restricted  to  linear  rules. 


2.6.  A Generalized  Model 

We  consider  now  a more  general  observation  model.  Let  z = x+0  + v, 
where  x and  v are  independent  zero  mean  Gaussian  random  vectors  with  positive 
deficit-  covariance,  matrices  Q and  F respectively.  The  results  obtained  in 
tic n 2.3  can  be  extended  to  find  a linear  minimax  estimate  for  u ^ x + 0 
under  a generalized  quadratic  loss  function  given  by 


L[6(z),u]  = (6-u) *c(6-u)  - 0 'Dj0 . 


n 2 2 

The  unknown  parameter  0 is  an  element  of  Q = {8  £ F<  : G^©  < e , e > Oj . 
The  fc Hewing  theorem  provides  the  desired  minimax  ortimate. 

Theorem  II: 


Dc  f i ne 


r (A, or)  - (A  + q+f)"1fcf(A+q+f3“1>d1  • <*d2. 


(2.6.1  ) 
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m 

Let  A*  and 

o'*  denote  any  solution  to 

r (A*,a*)A*  = 0 

(2.6.2) 

I 

r (A*, a*)  < 0 

(2.6.3) 

I 

Tr(D2A*)  < e2 

(2.6.4) 

I 

where 

A*  > 0 and  a*  > 0. 

(2.6.5) 

T 

With  this 

definition  and  notation  we  conclude  that: 

(i)  There  exists  a pair  A*  and  a*  satisfying 

(2.6.2)  through 

I 

(2.6.5),  and  either 

i 

a*  = 0 and  Tr[D0A*]  = e2  < e2 

2 * — 

*w 

or 

a*  > 0 and  Tr[D2A*]  = e2  = e2. 

3 

afts 

1 

| 

(ii)  The  decision  rule  6*  given  by 

6*  (z)  = (A*  + Q)  (A*  + Q +E)-1z 

(2.6.6) 

is  a linear  minimax  estimate  for  u.  The  details  of  the  proof  are  omitted 
1 ere  since  it  is  essentially  the  same  as  the  proof  of  Theorem  1 of  Section 
2.3. 

A further  extension  of  the  work  presented  in  this  thesis  can  be 


used  to  determine  a linear  minimax  terminal  state  estimate  for  a linear 
plant  subject  to  an  energy  constraint  with  a generalized  quadratic  loss 
function  as  the  underlying  loss  function. 
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